CDS

Accession Number TCMCG020C07916
gbkey CDS
Protein Id RAL50242.1
Location 2966512..2967186
Organism Cuscuta australis
locus_tag DM860_007916

Protein

Length 224aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA394036, BioSample:SAMN07347267
db_source NQVE01000067.1
Definition hypothetical protein DM860_007916 [Cuscuta australis]
Locus_tag DM860_007916

EGGNOG-MAPPER Annotation

COG_category S
Description Ulp1 protease family, C-terminal catalytic domain
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
ko01002        [VIEW IN KEGG]
ko04121        [VIEW IN KEGG]
KEGG_ko ko:K08597        [VIEW IN KEGG]
EC 3.4.22.68        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway -
GOs GO:0003674        [VIEW IN EMBL-EBI]
GO:0003824        [VIEW IN EMBL-EBI]
GO:0006508        [VIEW IN EMBL-EBI]
GO:0006807        [VIEW IN EMBL-EBI]
GO:0008150        [VIEW IN EMBL-EBI]
GO:0008152        [VIEW IN EMBL-EBI]
GO:0008233        [VIEW IN EMBL-EBI]
GO:0008234        [VIEW IN EMBL-EBI]
GO:0016787        [VIEW IN EMBL-EBI]
GO:0019538        [VIEW IN EMBL-EBI]
GO:0019783        [VIEW IN EMBL-EBI]
GO:0019784        [VIEW IN EMBL-EBI]
GO:0043170        [VIEW IN EMBL-EBI]
GO:0044238        [VIEW IN EMBL-EBI]
GO:0070011        [VIEW IN EMBL-EBI]
GO:0071704        [VIEW IN EMBL-EBI]
GO:0140096        [VIEW IN EMBL-EBI]
GO:1901564        [VIEW IN EMBL-EBI]

Sequence

CDS:  
ATGTCTTCAAAAGCCAATGACAAGATTCTCAGCTACAATGATGTTGTACTAAGGCGCTCGGATCTTGACATTCTTAGCGGACCATATTTTCTAAACGACCGAATAATCGAGTTCTATTTCAGTTTTCTCGCCTCGAGATTCCCATCTGAGGATGTTTTACTGTTGTCCCCGTCGATCACTTTCTGGATCAAAGAGTGCAGAGACACTACAATGCTTAAGGATTTCATAGAACCCCTCTGTCTATCTCAAAGGAAATTAATCATCTTCCCAATCAATGACAACTCGGACGTGGATTTAGCCGAAGGGGGAAGCCATTGGAGTTTACTTGCTTTTGAGAGGGGCTCCAATGTGTTTGTCCATCACGATTCCATCTCGGGCTGCATCAACAAGAATGACGCCAGACATGTCTACGAAGCCGTTCTCCCTTTTACAGCATCTGGAATGGCTACTTATGTTGATTACTCAGGGACACCGAAGCAAGAAAACTGGTATGATTGTGGGGTATATGTCCTTTCCTTTGCAAGGGTCATTTGTGATTGGTATGGAAGCAGAGGACCGGAGGAAGGAGTTGATCTGTGGTTTCCCTCCTTGAAGGAACAGATAAATGCAGGTGCTGTTTCGGAGATGCGCGACGAGATTCGAAGGTTAATTGTTGATCTAATGGCGAGGAAGTAA
Protein:  
MSSKANDKILSYNDVVLRRSDLDILSGPYFLNDRIIEFYFSFLASRFPSEDVLLLSPSITFWIKECRDTTMLKDFIEPLCLSQRKLIIFPINDNSDVDLAEGGSHWSLLAFERGSNVFVHHDSISGCINKNDARHVYEAVLPFTASGMATYVDYSGTPKQENWYDCGVYVLSFARVICDWYGSRGPEEGVDLWFPSLKEQINAGAVSEMRDEIRRLIVDLMARK